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LOAD BALANCING IN A NETWORK 

5 CROSS-REFERENCES TO RELATED APPLICATIONS 

The present application claims priority to U.S. Provisional Application No. 
60/188,142 filed March 10, 2000 (Attorney Docket No. 4835-US); and U.S. 
Patent Application No. 09/779,071 filed February 7, 2001 (Attorney Docket No. 
10 UDN0005), commonly owned, and hereby incorporated by reference for all 
purposes. 

BACKGROUND OF THE INVENTION 
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TECHNICAL FIELD ^ 



The invention relates to network traffic management in a computer environment. 
More particularly, the invention relates to load balancing and network traffic 
20 management among Web servers or caches in a computer environment. 

DESCRIPTION OF THE PRIOR ART 

There are a number of load balancing and traffic management products available 
25 for performing load balancing/traffic management among a cluster of web 
servers. However, none have the ability to schedule both HTTP and HTTPS 
(SSL) traffic persistently based on user sessions (cookies or session identifiers in 
HTTP GET requests). 

30 The majority of the approaches suffer from having a bottleneck because they 
require all traffic coming in and out of the cluster to go through a single machine. 
The single machine limits the amount of throughput available in the cluster. 
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Additionally, many approaches lack scheduling features that make efficient use 
of web server caches. None can fully utilize HTTP keepalive connections when 
performing URL based scheduling across multiple machines. 

Further, none of the approaches are integrated with a global content delivery 
network and therefore cannot automatically determine when to use the content 
delivery network instead of the local cluster to deliver content. 

The previous solutions to load balancing and traffic management are almost 
universally based on TCP/IP routing and by changing routes when servers a,l. 
Routing can be performed based on the type of content or eiements w,.h,n 

content. 

Load balancers come in two basic flavors. There are those where al, .raffle in 
and ou, of the site goes through a single box. This has -four way" data flow. 
Data flows in through a scheduler, then to a Web server, then back through the 
scheduler and out to the client. 

There are also ones that have -triangular data flow." In these, the data flows 
through the scheduler, to a Web sen,er and then back ou. to the Interne. d,reo.,y 
from the Web server. 

The former solution does no. require special software on the servers in the 
Custer. The latter solution requires special software on the server in the cluster. 
Because of this, both have advantages and disadvantages. 

Triangular data flow has the advantage that the throughput of the system can be 
better since traffic going out doesnt go through .he scheduler box. However, r. ,s 
30 difficult to implement, maintain, and administer because of the requirement of 
putting software on each server that sends traffic directly out instead of gorng 
through the scheduler. 
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The four way data flow suffers from a throughput bottleneck, but isn't hard to 
implement because a box is placed in the network, configured, and it does its 
work. This approach does not require any software on the servers in the cluster. 

5 Persistence is the ability to keep an individual user session tied to a single 
machine. Almost all load balancers have various policies for scheduling and 
maintaining persistence. All packets from the individual user will be sent to the 
machine that he is persistent with. That way, a machine can maintain the state 
of the user since the user is always scheduled to the same machine. 

10 

Most load balancing systems allow scheduling based on information about the 
client (IP address) or content contained in a request (cookie, content requests, 
etc.). However, since these systems are normally based on simple routing 
techniques, they tend to fail when it comes to dealing with requests that are 
1 5 encrypted since they do not have the ability to decrypt the request. 

It would be advantageous to provide a decrypting load balancing array system 
that provides load balancing and network management across Web servers and 
bypasses the single server bottleneck in a computer network. It would further be 
20 advantageous to provide a decrypting load balancing array system that decrypts 
SSL requests and performs SSL session scheduling. 

SUMMARY OF THE INVENTION 

25 

The invention provides a decrypting load balancing array system. The system 
provides load balancing and network management of traffic through a cluster of 
servers that avoids the typical single server bottleneck. In addition, the invention 
allows the decryption/encryption and session scheduling of SSL requests through 
30 independent load balancing servers. 

A preferred embodiment of the invention uses a Pentaflow approach to network 
traffic management. Pentaflow extends across an array of Decrypting Load 

3 
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Balancing Array (DLBA) severs sitting in front of back end Web servers. One of 
the DLBA servers acts as a scheduler for the array. All incoming requests are 
routed through the scheduler. 

The scheduler routes and load balances the traffic to the other DLBA servers 
(including itself) in the array. Each DLBA server routes and load balances the 
incoming request packets to the appropriate back end Web servers. Responses 
,o the requests from the back end Web servers are sent back to the DLBA server 
which forwards the response directly to the requesting client. 

Each DLBA server has the ability to decrypt SSL sessions in a distributed fashion 
and then schedule sessions to back end Web servers based on a cookie or 
session ID. SSL packets are decrypted in the DLBA server before being routed 
,o a back end Web server. This allows the DLBA server to schedule SSL 
sessions to back end Web servers based on a cookie or session ID. Response 
packets are encrypted by the DLBA server before being forwarded to the olrent. 

The invention also uses cookie injection to map a client to a specific back end 
Web server. In addition, any DLBA server in the array is capable of taking over 
20 the scheduler functionality in case of scheduler failure. 

URL based scheduling and hash scheduling of request packets with keepalive 
connections is easily performed due to the invention's architecture. 

Other aspects and advantages of the invention will become apparent from the 
following detailed description in combination with the accompanying drawrngs, 
illustrating, by way of example, the principles of the invention. 
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RR1EF DESCRIPTION OF T HF DRAWINGS 

Fig. 1 is a block schematic diagram of a preferred embodiment of the invention's 
Pentaflow packet flow according to the invention; 
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Fig. 2 is a block schematic diagram of an example of a preferred embodiment of 
the invention's Decrypting Load Balancing Array cluster according to the 
invention; 

5 

Fig. 3 is a block schematic diagram of a logical viewpoint of a preferred 
embodiment of the invention's Decrypting Load Balancing Array cluster 
according to the invention; 

10 Fig. 4 is a diagram of an example of a Client IP Hash Table according to the 
invention; 

Fig. 5 is a diagram of an example of a DLBA Server Status Table according to 
the invention; 

15 

Fig. 6 is a block schematic diagram of a DLBA server interface according to the 
invention; 

Fig. 7 is a diagram of an example of a Web Server Status Table according to the 
20 invention; 

Fig. 8 is a diagram of an example of a Summary Web Server Status Table 
according to the invention; and 

25 Fig. 9 is a block schematic diagram of a task-level viewpoint of a preferred 
embodiment of the invention according to the invention. 
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PPTflH F P pccr.mi»TIQN P«=thf INVENTION 

5 The invention is embodied in a decrypting load baiancing array system in a 
computer environment. A system according to the invention provides load 
balancing and network management of traffic through a cluster of servers 
avoids the typical single server bottleneck. In addition, the invention allows the 
decryption/encryption and session scheduling of SSL requests through 

1 o independent load balancing servers. 

The invention provides a Demoting Load Balancing Array (DLBA) sys.env A 
DLBA system is a group of machines that operate in parallel to perform SSL 
decryption and HTTP ,oad baiancing for a group of back end Wet , sen^K 
15 supports session ,D and cookie based persistence for both HTTPS and HTTP 
traffic to the back end Web servers. 

A DLBA system can be used to replace existing LocalDirector, F5 BIG-IP, 
Resonate Cen.ralDlspa.ch, e.c. hardware and software. The DLBA system a iso 
20 performs decryption, removing .he need for customers .0 set up or deploy 
servers or hardware to support SSL. 

High-level Benefits of the DLBA 
25 Some of the features of the DLBA are as follows: 

. Single virtual IP address for many Web servers. 

• Pentaflow - 5 way pipelined data flow. 

. Cookie based persistence/session id scheduling for both HTTPS (SSL) and 

30 HTTP. 

. URL based scheduling (including hash scheduling) with keepalive support. 
. Fully redundant schedulers for scheduler failure protection. 
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• Unique hash group based persistence tables to keep persistence tables 
manageable. 

The first item is supported by all local load balancers. All load balancers publish 



take traffic for that IP address. 

Referring to Fig. 1 , the Pentaflow feature is shown. The five-way pipelined data 
flow is a unique feature of the invention. What Pentaflow does is cause the flow 
10 of traffic coming from a client across the Internet or network 104 to be scheduled 
105 to a DLBA server A 101, DLBA server A 101 schedules the traffic 106 to 
DLBA server B 102. DLBA server B 102 load balances traffic 107 to Web server 
C 103. Web server C 103 responds 108 back to DLBA server B 102, which 
sends the response 109 back to the client 104. 



Pentaflow causes traffic in and out of the system to go through different 
machines. All traffic coming into the system will come in via route 1 105. 
However from route 1 105, the traffic could be scheduled to any DLBA server in 
the cluster. In Fig. 1 only one other DLBA server 102 is shown but, in practice, 
20 there will be an array of DLBA servers to which traffic may be scheduled to any 
one server. Here, the DLBA server 102 that the traffic is scheduled to make a 
load balancing decision to decide which back end Web server to communicate 
with. It then communicates with the Web server 103 and returns a response to 
the client. 



The invention can scale to almost any amount of throughput. The only limitation 
is the scheduler's ability to handle incoming traffic. Outgoing traffic is balanced 
among many DLBA servers. The servers utilize the full bandwidth possible for 
their connections because there is no bottleneck in the array. Each individual 
30 server passes connections through the switch at the maximum possible 
bandwidth. 
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,n Cisco's LocalDireotor, F5's BIG-.P, and many other load balancing solutions, 
all .raffle in and ou, o. the site is routed through a single box. That single box 
tha t al. of *e .raffle goes through quickly becomes the bottteneck to the en.,re 

system. 

Resonate's approach uses triangular data flow, where al, .raffle coming into .he 
sKe goes through one box .hen gets scheduled .o a server which responds to the 
client This approach requires kernel and application level software on each o. 
m. Web server boxes .o operate. The additional software causes .he system to 
be prone to failures, .o in.er.ere with the operation of the Web server, and to t» 
difficult to manage. 

Pentaflow has the advantage that incoming and outgoing .raffle takes different 
paths and does no. require any modiflcaflons .o me Web severs where the traffic 

15 is sent. 

The cooKe and session ID based persistence for HTTPS and HTTP is a major 
advantage of the invention that no other load balancing solution has. Because 
tbe invention uses Pentaflow, the system is able to decrypt SSL sess,ons 
distributed fashion and then schedule sessions to back end Web servers based 
on a cookie or session ID. 

Other load balancers support cookie based persistence (with their own method of 
scheduling, for HTTP bu, no, for SSL. The reason that Cher load « 
cannot support cookie based persistence for SSL is that .hey cannot decrypt me 
SSL session because they only have one scheduling box in the ,ncom,ng path of 
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traffic. 
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Because of Pentaflow, the scheduling of incoming .raffle is distributed across an 
array o. boxes, allowing the DLBA to perform decryption in a distributed fasNoa 
This feature is critical to e-commer=e sites. It allows them to afways schedule 
based on cookie/session ID regardless of me protocol a user is using to connect 
,o me site and regardless o, whether the user's IP changes during the sess,on. 
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The user will always be scheduled to the same machine. No other existing single 
solution can provide this. 

The DLBA also allows URL based scheduling, including hash scheduling, for 
5 scheduling URL requests to caches where a single connection can be scheduling 
to multiple back end machines and a keep-alive connection with the client can 
still be maintained. Many load balancers currently support basic URL based 
scheduling with their scheduling algorithms - DLBA URL based scheduling is 
different. 



A DLBA server establishes a connection with a client and the client keeps a 
connection alive with the DLBA server. When requests come in, the DLBA 
server schedules the requests to different back end Web servers via different 
connections and replies to the client using a single connection. 



No other load balancer can support this because they do not maintain a separate 
connection with the client. To perform URL based scheduling using these 
approaches, keep-alive support must be turned off on all the load balancing 
servers. Some servers (e.g., Microsoft IIS) do not support turning off keep-alive 
20 with HTTP connections, thereby causing their URL based scheduling to fail. 

Another advantage of the DLBA is that schedulers are fully redundant in case of 
failure. Any one of the boxes in the array can become a scheduler at any time in 
case the main scheduler fails. The scheduler is basically a simple router that 

25 persistently maps requests based on an internal table. Also, the scheduler boxes 
can also be DLBA servers and can schedule connections to themselves. The 
invention is different from many of the other load balancers where two load 
balancing boxes are purchased and one box just sits idle and only becomes 
active when the other fails. In a DLBA system, as long as a machine is 

30 functioning, it will actively server some function in the array. 

The DLBA server maintains persistent connections in all its paths when required 
and a special algorithm, hash group based persistence, is used to keep the 
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tables small and manageable. Without this feature, persistence tables would get 
huge failover could not perform predictably, and more connections would make 
the system less manageable. The hash group based persistence allows the 
system to handle any amount of traffic and the tables do not need to grow when 
5 the traffic grows, preventing them from becoming a bottleneck in the system. 

Summary of Advantages 

The DLBA supports any of the functionality supported by any other load balancer 
,0 but also has major feature advantages over other load balancers. Because of 
the Pentaflow architecture and despite all of its added features, the system gams 
more throughput as more DLBA servers are added. 

This is an attractive model to a company that sells product because, Instead of a 
1 5 customer buying a single box, the customer will buy more boxes as the,r Mho 
increases. The customer will see the value in purchasing additional boxes. 

in the future, the scheduler functionality could be burned into a router or switch 
for the highest routing performance and the DLBA servers themselves could 
20 become blades in a large router. 

Because the system can parse traffic on the way out, 'speederizinr/ content 
could be performed by the DLBA servers. URLs are rewritten so that they can be 
delivered via a content delivery network instead of all of the traffic being routed to 
25 a single Web server. The servers sample a portion of traffic, determine what 
content would be best served by changing their URLs to serve them from the 
Speedera Content Delivery Network and then change the URLs dynamically on 

the way out. The HTML page which was speederized could also be cached, ,f ,t 

is static, to gain higher performance so it did not need to be parsed each time on 
30 the way out. Additionally, client information can be added to the Speedera 

domain name to give the Speedera DNS tier information about the exact cl.ent 

machine making the DNS request. 
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These are just some of the features the invention could add to the system 
because it controls the entire flow of traffic from the DNS level to the HTTP 
delivery level and is able to parse or change things in any way along the flow. 

5 DLBA Scheduler Architecture 

All traffic into the array comes in through a box known as the DLBA scheduler. 
The DLBA scheduler is a DLBA server that has ARPed for the VIP address. 

10 With respect to Fig. 2, all DLBA servers 202, 203, 204 in the array 205 have the 
VIP address assigned on a loopback interface (or Ethernet alias if possible), but 
only one machine in the cluster 205 ARPs for the VIP, causing the switch 201 to 
send all traffic for the VIP to that machine. All machines have the VIP address 
on an interface (loopback or physical) so they can send traffic back out to the 

1 5 client from the VIP address and so they can take over as the scheduler in case of 
a scheduler failure. Any machine in the cluster 205 has the ability to become a 
scheduler at any time in case the current scheduler fails. 

The job of the DLBA scheduler is to route/load balance incoming traffic among 
20 the DLBA servers in the cluster in a persistent manner. It does this using routing 
by changing the incoming packet's MAC address to another machine in the 
cluster and then forwarding the packet, unchanged, back out through the switch. 
In this vein, the DLBA scheduler is, essentially, a router. 

25 The scheduler may also simply process the packet itself instead of forwarding it 
out to another machine in the cluster if the load balancing decision determines 
that the scheduler machine itself is the best machine to process the connection. 

The scheduler machine does not perform the three way handshake with the 
30 client, it simply forwards on all incoming packets to itself or another DLBA server 
in the cluster for processing. Those machines will respond directly back to the 
client. In this way, all traffic coming into the site comes in through the scheduler 
and all traffic going out is load balanced among the DLBA servers in the cluster. 
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Fig 2 depicts what Pentaflow scheduling looks like physically with a DLBA array 
205. The DLBA scheduler is the DLBA server 202 since all incoming traffic to the 
array 205 comes in through that server. 

5 Referring to Fig. 3, a more logical Pentaflow DLBA array diagram is shown. Fig. 

3 shows the same array configuration as Fig. 2, but more clearly shows the 

relationship of the DLBA scheduler 302 to the other machines in the array 303. 

The switch 301 sits logically above the array 303. All traffic goes through the 
1 o DLBA scheduler 302 which routes and load balances traffic throughout the array 

303. 

Configuration 

15 Configuration of the DLBA scheduler is done via a configuration file. The 
configuration file is pushed to the scheduler via a remote or local configurator! 

program. 

If the DLBA scheduler code is kernel code, it may be necessary to read the 
20 configuration file in user space and use an ioctl() to set values in the kernel for 
scheduling. Since the DLBA scheduler code should be fairly straightforward, a 
single ioctlQ should suffice. 
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Internal Operation 

Internally, the DLBA scheduler is essentially a router. Packets come in and the 
scheduler routes them by changing the MAC address of the packets and 
forwards the packets. With respect to Fig. 4, packets are forwarded accord.ng to 
a Client IP/Port to DLBA Server Hash Table 401 . 

Referring to Figs. 4 and 5 when the DLBA Server number 402 is found, the 
scheduler can consult the DLBA Server Status Table. 501 to find the MAC 
address 502 it should route the packet to. 
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When a packet comes in, the Client IP and ephemeral port of the packet are 
hashed to a bucket in the Client IP/Port to DLBA Server Hash Table 401. The 
table contains the DLBA server number 402 to route the connection to. The size 
5 of the hash table is configurable but typically only at the time the array is started. 
It normally is not changed on a running cluster. 

The incoming client IP and ephemeral port could be hashed as simply as this: 




10 bucket = (32 bit client IP address + 16 bit client port) % size of hash table 

How does the Client IP Hash Table get filled? Initially, the hash table is empty 
with a unsigned -1 set for each DLBA server #. When a client connection comes 
in, if the DLBA server number it hashes to is -1 , a load balancing decision will 
1 5 need to be made. The DLBA Server Status table is consulted to determine which 
DLBA server is best suited to receive the new connection. This can be as simple 
as finding the machine with the lowest # IP Groups assigned. 

Referring again to Figs. 4 and 5, the # IP Groups Assigned column 503 in the 
20 DLBA Server Status Table 501 shows how many IP groups (hash entries) have 
been assigned to this server. To evenly distribute traffic, each time the server is 
added as an entry into the Client IP Table 401, the # IP Groups 503 assigned 
value for that server should be incremented. The weight 51 1 can also be used to 
determine how many IP groups should be assigned to a server as well as the 
25 number of hits 505 or # open connections 506. 

When a DLBA server fails, the scheduler should assign all entries in the Client IP 
table that map to that server to unsigned -1 . This will cause the system to route 
around the failed box and assign new servers to the failed connections in a load 
30 balanced manner. 

Entries in the Client IP Hash Table cannot simply be reassigned to a server when 
it is added to the cluster. This is because there may be active connections that 
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routed and changing an entry in the table will cause the session to 
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,nstead it is best to wait until buckets are unused and when they become 
unused! reassign them to the new sen,er. Using this method, new servers 
gradually enter the cluster. 

With respect to Fig. 4, to determine when entries become unused, each entry in 
the Client IP Hash Table 401 has a Last Access in Seconds column 403. Th,s rs 
the time of the last access to the hash entry in seconds. Each time a packet 
comes into the scheduler and it hashes to a bucket, the buckef s Last Access in 
Seconds is updated to reflect the cunent time. 

If the scheduler hashes to a bucket with a very old last access time, it can 
reassign the bucket to a new server and make load balancing decisions. This rs 
how new DLBA servers added to the cluster get assigned hash entries in the 
table. The amount of time that it takes for an entry to timeout before ,t ,s 
reassigned is configurable in the configuration file. 

Referring again to Fig. 5, the DLBA Sewer Status Table 501 has a number of 
columns that need to be kept up to date on a running system. For example, the 
Avg. Resp. Time (average response time) 504, number of hits 505, number of 
open connections 506, and failed 507 columns. 

A status daemon runs on each DLBA server machine. The DLBA scheduler 
communicates with each status daemon via TCP/IP to get status information. 
The scheduler does not connect to all of the sewers at once, instead * loops 
through the list of servers, communicating with each one, closing the connects 
after each one responds. Any server that it cannot communicate with after a 
configurable number of retries is considered MM. There is a configurable delay 
after the status sweep of the DLBA sewers in me array. The time of the last 
sweep is available for remote monitoring to determine how stale the status data 
is in the table. 
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The DLBA Server Number 508, MAC Address 502, Physical IP Address 509, 
Enabled 51 0, and Weight 51 1 columns come from the configuration file. 

5 Scheduler Failover 

A failover daemon runs on each DLBA server machine and its job is to contact 
the scheduler at a regular interval to obtain a current client IP table and to see if 
the scheduler has failed. It determines failure of the scheduler by a simple ping 
10 test to the box to determine if the kernel is running properly. If the ping 
succeeds, the kernel is running and it is assumed that the machine is running. It 
does not use whether the current client IP table could be obtained as a 
determination of whether the scheduler is up since that may be prone to failure 
itself. 

15 

The client IP table can have a version number on the scheduler and on each 
server so the whole table does not need to be sent at each interval if it has not 
changed. Only the server numbers to hash buckets needs to be versioned and 
synchronized across the machines, the last access times do not need to be. 
20 When failover occurs, the last access time is set to the current time for all entries 
in the table. 

The DLBA Server Status Table does not need to be synchronized since it is 
created from the configuration file and each DLBA server is pushed the 
25 configuration file during configuration. The dynamic columns of the DLBA Server 
Status Table do not relate to existing sessions in process. 

A DLBA server determines the active scheduler by checking its ARP table for the 
VIP address. The ARP table contains the MAC address of the current scheduler. 
30 If that scheduler is determined to be down by the failover daemon, then the 
DLBA server ARPs for the VIP. 
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A number of machines could ARP for the VIP at the same time but only one of 
them will end up with the VIP address. As soon as a new scheduler is chosen, 
the failover daemon tests will no longer fail and the system will have chosen a 
new scheduler. Decoupling the determination of the scheduler from the actaal 
determinator (who has ARPed for the VIP) is tricky to synchronize. It may be 
necessary for a potential scheduler to contact all machines to make sure they 
don't want to become the active scheduler. If a potential scheduler contacts 
another potential scheduler that thought it should be the scheduler, they could 
determine who should be the scheduler by simply comparing their P hys,cal IP 
address (or other static quantity that is different between machines) and 
whichever machine's is larger will become the active scheduler. 

A ping test is used to determine if a scheduler is running to help avoid an ARP 
war ,f two machines have failover/synchronlzation daemons that cant 
communicate with whatever status method is used, they could end up fighfng 
over who is the active scheduler for a cluster. If a method other man p,ng . 
used that method could fail and both methods would fight with ARPs. Us,ng 
ping is a pretty safe method in that if it doesn't work, the machine isn't around but 
the ping check itself is not prone to failure. 



20 



Panic Mode 



i x 



A -panic mode" can be used in the system that will evenly distribute the Chant IP 
hash table among ail servers in the cluster in case a meltdown occurs and the ^ 
25 table becomes messed up for whatever reason. This will break all exist,ng 
sessions but will quickly bring the system back to a normal state. 

Using the Panic Mode clears the dynamic fields in the DLBA Server Status table 
to their defaults (the configuration fi.e defaults) and sets the current schedul.ng 
30 box statically until the problem can be diagnosed and repaired. 
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Monitoring 

An ioctIO is used to dump the DLBA server Status table and to dump the Client 
IP Hash table. A daemon is written that performs the ioctl() and writes it to the 
5 socket read-only. This can be connected to a user-interface. 

Prototype 

A prototype of the scheduler can be developed without DLBA servers. The 
10 prototype is similar to a standard TCP/IP routing based NAT load balancer. If 
Web servers are run on each machine instead of DLBA servers, the scheduler 
routes traffic to the servers and they respond directly to the client. When the 
Web servers fail, the scheduler routes around them. When the scheduler fails, 
another box picks up as the scheduler. 

15 

DLBA Server Architecture 

With respect to Fig. 6, a DLBA server 601 is essentially a proxy Web server. It is 
a standard application level Web server that takes requests from clients 603 and 
20 forwards them 604 to back end Web servers 602. The DLBA server 601 
processes the replies 605 from the back end Web servers 602 and sends the 
response directly back to the client 606. 

If incoming requests are SSL, the DLBA server 601 performs the key exchange 
25 and decrypts the request before forwarding it to the back end Web servers 602. 
The DLBA server 601 will also encrypt the response on the way out 606 for SSL 
sessions. 

The view of a DLBA server is that it accepts incoming connections to the VIP, 
30 processes them, and returns a reply from a source of the VIP. Packets to a 
DLBA server are routed by a DLBA scheduler. However, the DLBA proxy Web 
server process does not know that. 

17 



BNSDOCID: <WO 016989OA1_l_> 



PCT/US01/07574 
WO 01/69890 

The DLBA sewer Web process generally listens to ports 80 and 443 on the VIP 
address. When requests come in, the server simply connects with the dent 
and then reads the request It processes the requests by opening a socket to the 
back end Web server and forwards the request to the back end. 

5 The DLBA server then reads the reply from the back end and writes a packet to 
the client whenever it reads a packet from the back end Web server. The traffic 
simply flows through the DLBA sewer on the way out. On the way in, the request 
must be read, up until the end of each request (there may be multiple requests * 

,0 the incoming session is kept a!ive), before a connection is opened to the back 
end Web servers. However, the reply is processed as it is read. For a given 
session, one packet is read and written before the next packet is written to 
reduce the amount of buffering required by the DLBA server. 

15 The connection to the back end Web servers does not need to be kept alive but 
the connection to the client does (if the client supports keep alive connects), 
in fact it is better to not keep the back end connection alive, not for performance, 
but to' allow for advanced scheduling. Where this can become useful is t the 
DLBA server performs URL hash based or other URL parsing to schedule 

20 requests. 

Consider, for example, the case in which the DLBA server was a load balancer 
for a set of Web caches. An individual client might request a.html, b.grf, and c.grf 
. three elements that would constitute a full Web page. « *e DLBA server was 
25 using URL hashing to spread content requests consistently to back end Web 
caches (to keep each cache "hof), the three items would exist on drfferent 
machines. 

For example, when the DLBA server keeps a keep alrve connection to the client, 
30 it will: 

• read the first request for a.html 

. hash a.html to a back end Web cache 
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• open a socket and read a.html from the back end, writing it to the client as it 
reads it 

• close the socket to the back end Web cache 

• hash b.gif to a back end Web cache 

5 • open a socket and read b.gif from the back end, writing it to the client as it 
reads it 

• close the socket to the back end Web cache 

• hash c.gif to a back end Web cache 

• open a socket and read c.gif from the back end, writing it to the client as it 
1 0 reads it 

• close the socket to the back end Web cache 

Notice that the connection to the client is kept alive through the whole process 
even though three socket connections to the back end were performed. 

15 

Since the connections to the back end are all through a switch, performance 
across them is extremely high. Keeping the connection alive to the client 
reduces the number of packets on the Internet, thereby increasing the end user 
performance. 

20 

The DLBA server parses incoming requests and schedules them based on a 
scheduling algorithm. The two most popular algorithms are probably 
cookie/session id based scheduling and URL based scheduling. Cookie based 
scheduling will persistently map an individual user to a back end Web server 
25 regardless of whether they are using SSL or HTTP. The back end Web server 
only sees HTTP traffic because the DLBA server performs decryption. 

The most common URL based scheduling is hash based scheduling with either a 
consistent hash or a binary list that has the properties of a consistent hash. This 
30 is used when the system is a front end to an array of Web caches. The 
cookie/session ID based scheduling is used when the system is a front end to a 
set of application Web servers that keep state information based on a user's 
cookie. 
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Configuration 

Configuration of a DLBA server is performed by the same process as the 
5 configuration of a DLBA scheduler I. performed. There is a single configure.™ 
file that is in the same forma, as the network configuration file. The same 
software is used to push and commit new configuration files ,o all DLBA servers 
in the array (including the DLBA scheduler). 

10 Internal Operation 

internally, the DLBA sen,er scheduler is essenfially a proxy Web sen,er. It 
schedules requests to back end Web severs either by looKng at a coo,e v,ue 
or session ID in an HTTP get request or by hashing the •ncom.ng URL to a Web 
15 server number. 

„ me URL hash is used, men no state table needs to be maintained since the 

20 they are caches, the caches wil, simply reorganize. A consistent hash or 
something that has the properties o, a consistent hash (without usrng the 
lis J hash algorithm) shouid be used for URL hash based schedu„ng so 
^en servers are added and removed the caches stay as "hof as poss.ble. 

25 When cootie scheduling is used, no state table needs .o be maintained because 
the system uses an approach that keeps server mapping on the client s,de. Th,s 
is called "cookie injection". 



Cookie Injection 



30 



When cookie/session ID scheduling is implemented, cookie injection is used to 
map a user to a specific machine in .he cluster. This is a novel feature of *e 



DLBA. 
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If persistent cookie scheduling is enabled, the DLBA servers scan the headers of 
outgoing requests from the Web caches looking for the Set-Cookie header item 
that is issued by the Web server to initiate a session. The specific cookie name 
5 that starts a session is configurable. 

When the DLBA server sees a Web server initiate a Set-Cookie request, it injects 
another additional name/value pair into that set cookie that identifies the server 
that the Set-Cookie came from. For example, something like "SpServer=10" is 
10 injected as an additional cookie name/value pair in the Set-Cookie response to 
the client. 

This additional cookie has the same expiration time as the session identifier so 
persistence will be maintained as long as the session is maintained. 

15 

When requests come in, the DLBA server looks for the SpServer cookie. If the 
DLBA server finds the cookie, it will schedule to the server identified by the 
cookie value. Each server can have a unique integer key, defined in the 
configuration file, that the cookie value maps to. 

20 

Reviewing the process again, the DLBA server scans incoming requests for a 
SpServer cookie. If it finds one, it looks at the value and schedules the request 
to the machine represented by the value. The DLBA server scans headers from 
Web server responses for a Set-Cookie that is creating a cookie to start a 
25 session. When it finds one, it injects a SpServer cookie into the Set-Cookie reply 
with the same attributes as the session. 

The name of the cookie the system injects should be configurable in the 
configuration file. 

30 

By storing the cookie-to-server mapping on the client, the DLBA servers do not 
have to maintain a table that maps cookies to back end Web servers. Also, since 
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the mapping has the same expiration as the session, there isn't a problem where 
the mapping will go away or last longer than the session identifier will. 



Status Tables 

Referring to Fig. 7, each DLBA server maintains a server status table 701 of all 
the back end Web servers. This table 701 contains various statistics about the 
back end Web servers it is connecting to such as the Physical IP Address 702, 
Avg. Resp. Time 703, Number of Hfts 704, and Number of Open Connections 
10 705. 

When a DLBA server tries to connect to a back end Web server and can't 
connect, it will reroute the request to another machine. So, none of the servers 
need to actively probe the back end Web servers, when they can't schedule a 
15 connection, that's considered a failure and they can reroute the connects at 
that time. 

The DLBA scheduler maintains an overall statistics table for the cluster by 
connection to each DLBA server, getting its status table and adding its status to 
20 an overall status stable. 

With respect to Fig. 8, the Summary Web Sen/er Status Table 801 is maintained 
by the active DLBA scheduler. The Summary Status Table 801 does not need to 
be synchronized across machines since it is ]ust for monitoring and not for 
25 scheduling. The monitoring information includes the Physical IP Address 802, 
Number of Cookie Groups Assigned 803, Total Number of Hits 804, and Total 
Number of Open Connections 805. 

Using cookie iniection removes the previous complications of what happens 
30 when a server is added. When a server is added to the cluster, cookies will ,ust 
be injected with that new machine's identifier and the system doesn't have to 
worry about cookie hash groups or synchronizing state across machines, eto. 
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DLBA servers are able to load balance incoming requests to the back end Web 
servers in an intelligent fashion (open connections, machine load, SNMP 
information, response time) to distribute user sessions among the back end Web 
server machines. 



The DLBA servers should have the ability to perform SSL encryption and 
decryption in hardware (using ncipher or rainbow cards) to accelerate the 
encryption/decryption process. 

10 Monitoring 

Each DLBA server will be running a daemon that has a socket open that dumps 
the state information of the tables used for scheduling and status. This can be 
the same daemon that the DLBA scheduler uses to dump its state information 
15 and the information dumped by the DLBA server should be in the same format as 
the DLBA scheduler data. 

Task Description 

20 Referring to Fig, 9, the Rev Traffic Pkts module 901 receives incoming traffic 
packets into the DLBA scheduler or server. The Rev Traffic Pkts module 
distributes the packets depending on whether the destination of the packets are 
to a server, or a client (response) and whether the DLBA server is also the 
scheduler. 



If the DLBA server is also the scheduler, packets destined for a Web server are 
routed through the Route/Load Balance Pkts module 906. The Route/Load 
Balance Pkts module 906 performs the routing and load balancing of the packets 
to the DLBA servers (including itself) in the array using the Client IP Hash Table 
30 912. The Route/Load balance Pkts module 906 uses the Server Status Table 
911 to determine the MAC address to route the packet(s) to, as previously 
described. 



!■•.- ■■• : . 
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The Configure System module 903 accepts pushed configuration information 
which is used to establish the Client IP Hash Table 912 and Server Status Table 
911. 

Server status is obtained by the Get Server Status module 902 which updates 
the Server Status Table 912 on the acting scheduler. On the serve, side, the 
status of the resident sewer is sent to the inquiring scheduler via its Server 

Daemon 910. 

The Failover/Synch Daemon 904, as described above, periodically checks with 
the scheduler to determine if the scheduler is still alive. It determines the actrve 
scheduler by checking the ARP Table 91 3 for the MAC address of the scheduler. 
When the scheduler fails, the determination of which server becomes the new 
scheduler is taken upon by the Elect Scheduler module 905. The Elect 
Scheduler modules of the servers in the array negotiate to become the new 
scheduler. 

in the role of server, packets from the RCV Traffic Pkts module 901 are 
processed by the Process Connection module 907. The Process Connect™ 
module 907 takes the requests, routed by the scheduler, from clients and 
forwards them to back end Web sewers. SSL requests are decrypted by the 
Encrypt/Decrypt SSL module 908 before forwarding the requests to back end 
Web servers. 

Responses from the Web sewer come in through the Rev Traffic PWs module 
901 and are sent to the Forward Ftt to Client module 909. The Forward Pkt to 
Client module 909 forwards the responses directly to the client. SSL responses 
are encrypted by me Encrypt/Decrypt SSL module 908 before the response ,s 
sent directly to the client by the Forward Pkt to Client module 909. 

Although the invention is described herein with reference to the preferred 
embodiment, one skilled in the art will readily appreciate that other applicauons 
may be substituted for those set forth herein without departing from the spmt and 

24 



30 



BNSDOCID; <WO. 0169890A1_I_> 



WO 01/69890 PCT/US01/07574 

scope of the present invention. Accordingly, the invention should only be limited 
by the Claims included below. 
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CLAIMS 



! A process for routing packets through a load balancing array of servers 
5 across a network in a computer environment, comprising the steps of: 

providing a plurality of load balancing servers; 

providing at least one back end Web server; 

wherein one of said load balancing servers is also a scheduler; 

wherein a request packet from a client is routed through said scheduler; 
10 wherein said scheduler routes and load balances said request packet to a 

load balancing server; 

wherein said load balancing server routes and load balances sa,d request 

packet to a back end Web server; 

wherein said back end Web server's response packet to said request 
15 packet is sent to said load balancing server; and 

wherein said load balancing server sends said response packet directly to 

said client. 

2. The process of Claim 1 , wherein said scheduler routes and load balances 
20 client requests to itself. 

3. The process of Claim 1 , further comprising the steps of: 
detecting the failure of said scheduler; and 

electing one of said load balancing servers as the new scheduler. 



25 



4 The process of Claim 1 , wherein said schedu.er detects the failure of other 
load balancing servers; and wherein said scheduler stops routing packets to any 
failed load balancing servers. 

30 5 The process of Claim 1 , wherein said load balancing server schedu.es 
sessions to back end Web servers based on a cookie or session ID. 
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6. The process of Claim 1 , wherein said load balancing server uses cookie 
injection to map a client to a specific back end Web server. 

7. The process of Claim 1, wherein said load balancing server decrypts said 
5 request packet if it is an SSL session before routing and load balancing said 

request packet to a back end Web server. 

8. The process of Claim 7, wherein said load balancing server encrypts said 
response packet if it is an SSL session before sending said response packet 

1 0 directly to said client. 

9. The process of Claim 1 , wherein said load balancing server establishes a 
connection with said client and said client keeps said connection alive with said 
load balancing server. 

15 

10. The process of Claim 9, wherein said load balancing server performs URL 
based scheduling of request packets. 

1 1 . The process of Claim 9, wherein said load balancing server performs hash 
20 scheduling of request packets. 

12. The process of Claim 1, wherein said load balancing server maintains 
persistent connections in all its paths when required; and wherein said load 
balancing server uses hash group based persistence to maintain its persistence 

25 tables. 



13. The process of Claim 1, wherein said load balancing server detects if a 
back end Web server fails; and wherein said load balancing server stops routing 
request packets to failed back end Web servers. 

30 

14. The process of Claim 1 , further comprising the step of: 
providing a content delivery network; and 
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wherein said load baJanolng server modifies select URLs in the HTML 
page in said response packet .0 se^e them from said content deliver network. 

15 . The process of Claim 14, wherein HTML pages that have modified URLs 
5 are cached to improve performance. 

16. An apparatus for routing packets through a load balancing array of servers 
across a network in a computer environment, comprising: 

a plurality of load balancing servers; 
1 0 at least one back end Web server; 

wherein one of said load balancing servers is also a scheduler; 
wherein a request packet from a client is routed through said scheduler; 
wherein said scheduler routes and load balances said request packet to a 

load balancing server; 
15 wherein said load balancing server routes and load balances sa,d request 

packet to a back end Web server; 

wherein said back end Web servers response packet to said request 

packet is sent to said load balancing server; and 

wherein said load balancing server sends said response packet directly to 

20 said client. 

17. The apparatus of Claim 16, wherein said scheduler routes and load 
balances client requests to itself. 

25 1 8. The apparatus of Claim 1 6, further comprising: 

a module for detecting the failure of said scheduler; and 

a module for electing one of said load balancing servers as the new 

scheduler. 

30 19 The apparatus of Claim 16, wherein said scheduler detects the failure of 
other load balancing servers; and wherein said scheduler stops routing packets 
to any failed load balancing servers. 
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20. The apparatus of Claim 16, wherein said load balancing server schedules 
sessions to back end Web servers based on a cookie or session ID. 

21. The apparatus of Claim 16, wherein said load balancing server uses 
5 cookie injection to map a client to a specific back end Web server. 

22. The apparatus of Claim 16, wherein said load balancing server decrypts 
said request packet if it is an SSL session before routing and load balancing said 
request packet to a back end Web server. 



10 



23. The apparatus of Claim 22, wherein said load balancing server encrypts 
said response packet if it is an SSL session before sending said response packet 
directly to said client. 



15 24. The apparatus of Claim 16, wherein said load balancing server 
establishes a connection with said client and said client keeps said connection 
alive with said load balancing server. 



25. The apparatus of Claim 24, wherein said load balancing server performs 
20 URL based scheduling of request packets. 



26. The apparatus of Claim 24, wherein said load balancing server performs 
hash scheduling of request packets. 

25 27. The apparatus of Claim 16, wherein said load balancing server maintains 
persistent connections in all its paths when required; and wherein said load 
balancing server uses hash group based persistence to maintain its persistence 
tables. 



30 28. The apparatus of Claim 16, wherein said load balancing server detects if a 
back end Web server fails; and wherein said load balancing server stops routing 
request packets to failed back end Web servers. 



29 



3NSDOCID; <WO 0169890A1_I_> 



PCT/W^l/07574 

WO 01/69890 

29. The apparatus of Claim 1 6, further comprising: 
a content delivery network; and 

wherein said load balancing server modifies select URLs in the HTML 
page in said response packet to serve them from said content delivery network. 

30. The apparatus of Claim 29, wherein HTML pages that have modified 
URLs are cached to improve performance. 
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